Many of the figures and analysis methodology were based off of this tutorial from NYU. https://learn.gencore.bio.nyu.edu/rna-seq-analysis/gene-set-enrichment-analysis/

Inputs Received

Information received from the csv samplesheet:

Validating input parameters…

## ✓ Sample 1 ('chen_phillips') parameters validated
## ✓ Sample 2 ('akay') parameters validated
## ✓ Sample 3 ('weiser') parameters validated

All parameters validated successfully!

1. Chen & Phillips Analysis

Metadata information received from the tsv table:

##   replicate strain
## 1      rep1  hrde1
## 2      rep2  hrde1
## 3      rep3  hrde1
## 4      rep1  hrde2
## 5      rep2  hrde2
## 6      rep3  hrde2
## 7      rep1     wt
## 8      rep2     wt
## 9      rep3     wt

Analysis Methods

Here, count data from the RNA-seq experiment is read in the form of a counts matrix. Each column holds data from one sample, and each row represents a gene, such that the i-th row and n-th column tells how reads of gene i were measured in sample n. The values received should be un-normalized counts of sequencing reads or fragments.

# # Read Count Matrix
counts_file <- trimws(sample_params$salmon_merged_gene_counts_file_path)
count.matrix <- read_tsv(counts_file)
count.matrix <- count.matrix %>%
  mutate(across(3:ncol(.), ~ as.integer(.x))) %>%
  column_to_rownames("gene_id")

gene.reference <- dplyr::select(count.matrix, 1)

count.matrix <- count.matrix %>%
  dplyr::select(-1)

Information entered as metadata describes the samples (columns) of the count matrix. This metadata is combined with the sample/column names from the count matrix so the accuracy of the metadata can be reviewed. Take time to review the table below.

# Design ColData Matrix
sample <- colnames(count.matrix)
colData <- data.frame(sample)

# Add Columns for Condition to colData
for (i in colnames(metadata)){
  colData[i] <- metadata[i]
}

colData <- column_to_rownames(colData, "sample")
##            replicate strain 
## hrde1_rep1 "rep1"    "hrde1"
## hrde1_rep2 "rep2"    "hrde1"
## hrde1_rep3 "rep3"    "hrde1"
## hrde2_rep1 "rep1"    "hrde2"
## hrde2_rep2 "rep2"    "hrde2"
## hrde2_rep3 "rep3"    "hrde2"
## wt_rep1    "rep1"    "wt"   
## wt_rep2    "rep2"    "wt"   
## wt_rep3    "rep3"    "wt"

Using the count matrix, column metadata, and user-inputed design formula (expressing the variables to be used in modeling), a DESeq data set is created.

The experimental variable and the associated control/experimental conditions are given by the user in “ref_level_cond” and “ref_level_value” in the sampleInfo file respectively.

Pre-filtering is performed, keeping only genes that have a count of at least 10 in a minimum number of samples. The minimum number of samples is decided by calculating the number of times the reference level value (control condition) is listed in the metadata with the assumption that experimental conditions will be repeated the same number of times.

Standard differential expression analysis is performed with the DESeq function, and regularized log transformation (rlog) transforms count data to a log2 scale for PCA.

# Set Reference Level
reflevelcond <- trimws(sample_params$ref_level_cond)
reflevelvalue <- trimws(sample_params$ref_level_value)
# Run DESeq2 analysis
deseq_result <- runDESeq2Analysis(
  count.matrix = count.matrix,
  colData = colData,
  design = sample_params$design,
  ref_level_cond = reflevelcond,
  ref_level_value = reflevelvalue,
  metadata = metadata
)

dds <- deseq_result$dds
rld <- deseq_result$rld

If input data is designated to be batch corrected, the ComBat_seq() function from the SVA Bioconductor package is used. The batch (condition) to be regressed out is given in sampleInfo.csv as “batch_cond”, and the sample information for this condition is taken from the metadata.

The group argument of ComBat_seq() specifies biological covariates, whos signals should be preserved in adjusted data. All conditions (columns) from the metadata which are not to be regressed out are passed to the group argument. Differential expression analysis is otherwise performed as described above on batch-corrected counts.

if (as.logical(trimws(sample_params$batch_correct))) {
  # Replicate / Batch Correction
  batch <- trimws(sample_params$batch_cond)

  batch.correct <- ComBat_seq(
    counts = as.matrix(count.matrix),
    batch = metadata[[batch]],
    group = metadata[[colnames(metadata)[colnames(metadata) != batch]]]) %>%
    as.data.frame()

  # Run DESeq2 analysis on batch-corrected counts
  batch_deseq_result <- runDESeq2Analysis(
    count.matrix = batch.correct,
    colData = colData,
    design = sample_params$design,
    ref_level_cond = reflevelcond,
    ref_level_value = reflevelvalue,
    metadata = metadata
  )

  batch.correct.dds <- batch_deseq_result$dds
  batch.correct.rld <- batch_deseq_result$rld
  
  # Cleanup
  rm(batch, batch.correct, keep)
}

Reference level condition and value (ref_level_cond and ref_level_value respectively) inputs from sampleInfo.csv are used to create contrast terms to compare treatment samples with control samples. This relies on the design formula being formatted correctly to produce comparisons relevant to the experimental condition.

# # Find relevant contrasts
contrasts <- list()

# Factor values of experimental condition
pattern <- factor(metadata[[reflevelcond]])
comp <- levels(pattern)

# Remove control state from experimental condition vector
comp <- comp[comp != reflevelvalue]

# Create Contrasts
for (i in 1:length(comp)){
  contrasts[[i]] <- c(reflevelcond, comp[i], reflevelvalue)
}

Based on the number of experimental conditions (identified in the ref_level_cond column of the metadata) and the contrast terms comparing them to the experimental control (ref_level_value), differential expression results are extracted from the DESeq data set object.

# Find results from relevant contrasts
results <- list()

# Pull results from DESeq object with contrasts
for (i in 1:length(contrasts)){
  results[[i]] <- list()
  results[[i]][["DataFrame"]] <- results(dds.for.analysis, contrast = contrasts[[i]]) %>%
    data.frame() %>%
    rownames_to_column("gene_id") %>%
    mutate(gene_name = gene.reference[gene_id, 1], .before=baseMean) %>% # Add gene names to DESeq results matrices
    column_to_rownames("gene_id")
  
  results[[i]][["DESeqDataSet"]] <- results(dds.for.analysis, contrast = contrasts[[i]])
}

Principle Component Analysis

The Bioconductor package PCAtools is used to perform principle component analysis on regularized log transformed count data. Scree plots show principle component numbers on the x-axis, and their respective eigenvalues on the y-axis.

The third graph plots experimental metadata against a number of PCs and their gene loadings to visualize the agreement with the gene expression pattern of each condition for each PC.

No Regression

Replicate Regressed Out

strain_hrde1_vs_wt

DEG Report

Ontology Report

Transfering WORMBASE gene IDs to ENTREZID:

308 gene IDs were not transfered from WORMBASE to ENTREZID 
12108 gene IDs were successfully transfered from WORMBASE to ENTREZID 

Number of highly variable genes identified (with | L2FC | > 0.8 and padj < 0.1) is:

495

Number of highly upregulated genes identified (with L2FC > 0.8 and padj < 0.1) is:

377

Number of highly downregulated genes identified (with L2FC < -0.8 and padj < 0.1) is:

118

Dysregulated

Gene Ontology GSEA
protein phosphorylation

phosphorylation

dephosphorylation

ribonucleoprotein complex biogenesis

tRNA metabolic process

collagen and cuticulin-based cuticle development

mitochondrial translation

rRNA processing

male gamete generation

mitochondrial gene expression

RNA modification

mitochondrial respiratory chain complex assembly

cellular respiration

immune response

defense response to symbiont

immune system process

establishment of protein localization to membrane

developmental process involved in reproduction

neuropeptide signaling pathway

mitochondrial electron transport, NADH to ubiquinone

electron transport chain

cell fate commitment

Reactome Pathway GSEA
Mitochondrial translation termination

Translation

Mitochondrial translation elongation

Mitochondrial translation

SRP-dependent cotranslational protein targeting to membrane

Formation of a pool of free 40S subunits

GTP hydrolysis and joining of the 60S ribosomal subunit

Nonsense Mediated Decay (NMD) independent of the Exon Junction Complex (EJC)

Physiological factors

Respiratory electron transport

Metabolism of RNA

Upregulated

GO Over-representation
Reactome Pathway Over-representation

Downregulated

GO Over-representation

Reactome Pathway Over-representation

Table Reactome Pathway Over-representation Analysis of Downregulated Genes has no rows.

strain_hrde2_vs_wt

DEG Report

Ontology Report

Transfering WORMBASE gene IDs to ENTREZID:

308 gene IDs were not transfered from WORMBASE to ENTREZID 
12108 gene IDs were successfully transfered from WORMBASE to ENTREZID 

Number of highly variable genes identified (with | L2FC | > 0.8 and padj < 0.1) is:

1557

Number of highly upregulated genes identified (with L2FC > 0.8 and padj < 0.1) is:

1051

Number of highly downregulated genes identified (with L2FC < -0.8 and padj < 0.1) is:

506

Dysregulated

Gene Ontology GSEA
dephosphorylation

defense response to bacterium

protein phosphorylation

phosphorylation

male gamete generation

neuropeptide signaling pathway

sexual reproduction

mitochondrial transmembrane transport

Reactome Pathway GSEA

Table Reactome Pathway GSE Of All Differentially Expressed Genes has no rows.

Upregulated

GO Over-representation
Reactome Pathway Over-representation

Downregulated

GO Over-representation

Reactome Pathway Over-representation

2. Akay Analysis

Metadata information received from the tsv table:

##   strain replicate
## 1  hrde1      rep1
## 2  hrde1      rep2
## 3  hrde1      rep3
## 4     wt      rep1
## 5     wt      rep2
## 6     wt      rep3

Analysis Methods

Here, count data from the RNA-seq experiment is read in the form of a counts matrix. Each column holds data from one sample, and each row represents a gene, such that the i-th row and n-th column tells how reads of gene i were measured in sample n. The values received should be un-normalized counts of sequencing reads or fragments.

# # Read Count Matrix
counts_file <- trimws(sample_params$salmon_merged_gene_counts_file_path)
count.matrix <- read_tsv(counts_file)
count.matrix <- count.matrix %>%
  mutate(across(3:ncol(.), ~ as.integer(.x))) %>%
  column_to_rownames("gene_id")

gene.reference <- dplyr::select(count.matrix, 1)

count.matrix <- count.matrix %>%
  dplyr::select(-1)

Information entered as metadata describes the samples (columns) of the count matrix. This metadata is combined with the sample/column names from the count matrix so the accuracy of the metadata can be reviewed. Take time to review the table below.

# Design ColData Matrix
sample <- colnames(count.matrix)
colData <- data.frame(sample)

# Add Columns for Condition to colData
for (i in colnames(metadata)){
  colData[i] <- metadata[i]
}

colData <- column_to_rownames(colData, "sample")
##             strain  replicate
## hrde1_rep_1 "hrde1" "rep1"   
## hrde1_rep_2 "hrde1" "rep2"   
## hrde1_rep_3 "hrde1" "rep3"   
## wt_rep_1    "wt"    "rep1"   
## wt_rep_2    "wt"    "rep2"   
## wt_rep_3    "wt"    "rep3"

Using the count matrix, column metadata, and user-inputed design formula (expressing the variables to be used in modeling), a DESeq data set is created.

The experimental variable and the associated control/experimental conditions are given by the user in “ref_level_cond” and “ref_level_value” in the sampleInfo file respectively.

Pre-filtering is performed, keeping only genes that have a count of at least 10 in a minimum number of samples. The minimum number of samples is decided by calculating the number of times the reference level value (control condition) is listed in the metadata with the assumption that experimental conditions will be repeated the same number of times.

Standard differential expression analysis is performed with the DESeq function, and regularized log transformation (rlog) transforms count data to a log2 scale for PCA.

# Set Reference Level
reflevelcond <- trimws(sample_params$ref_level_cond)
reflevelvalue <- trimws(sample_params$ref_level_value)
# Run DESeq2 analysis
deseq_result <- runDESeq2Analysis(
  count.matrix = count.matrix,
  colData = colData,
  design = sample_params$design,
  ref_level_cond = reflevelcond,
  ref_level_value = reflevelvalue,
  metadata = metadata
)

dds <- deseq_result$dds
rld <- deseq_result$rld

If input data is designated to be batch corrected, the ComBat_seq() function from the SVA Bioconductor package is used. The batch (condition) to be regressed out is given in sampleInfo.csv as “batch_cond”, and the sample information for this condition is taken from the metadata.

The group argument of ComBat_seq() specifies biological covariates, whos signals should be preserved in adjusted data. All conditions (columns) from the metadata which are not to be regressed out are passed to the group argument. Differential expression analysis is otherwise performed as described above on batch-corrected counts.

if (as.logical(trimws(sample_params$batch_correct))) {
  # Replicate / Batch Correction
  batch <- trimws(sample_params$batch_cond)

  batch.correct <- ComBat_seq(
    counts = as.matrix(count.matrix),
    batch = metadata[[batch]],
    group = metadata[[colnames(metadata)[colnames(metadata) != batch]]]) %>%
    as.data.frame()

  # Run DESeq2 analysis on batch-corrected counts
  batch_deseq_result <- runDESeq2Analysis(
    count.matrix = batch.correct,
    colData = colData,
    design = sample_params$design,
    ref_level_cond = reflevelcond,
    ref_level_value = reflevelvalue,
    metadata = metadata
  )

  batch.correct.dds <- batch_deseq_result$dds
  batch.correct.rld <- batch_deseq_result$rld
  
  # Cleanup
  rm(batch, batch.correct, keep)
}

Reference level condition and value (ref_level_cond and ref_level_value respectively) inputs from sampleInfo.csv are used to create contrast terms to compare treatment samples with control samples. This relies on the design formula being formatted correctly to produce comparisons relevant to the experimental condition.

# # Find relevant contrasts
contrasts <- list()

# Factor values of experimental condition
pattern <- factor(metadata[[reflevelcond]])
comp <- levels(pattern)

# Remove control state from experimental condition vector
comp <- comp[comp != reflevelvalue]

# Create Contrasts
for (i in 1:length(comp)){
  contrasts[[i]] <- c(reflevelcond, comp[i], reflevelvalue)
}

Based on the number of experimental conditions (identified in the ref_level_cond column of the metadata) and the contrast terms comparing them to the experimental control (ref_level_value), differential expression results are extracted from the DESeq data set object.

# Find results from relevant contrasts
results <- list()

# Pull results from DESeq object with contrasts
for (i in 1:length(contrasts)){
  results[[i]] <- list()
  results[[i]][["DataFrame"]] <- results(dds.for.analysis, contrast = contrasts[[i]]) %>%
    data.frame() %>%
    rownames_to_column("gene_id") %>%
    mutate(gene_name = gene.reference[gene_id, 1], .before=baseMean) %>% # Add gene names to DESeq results matrices
    column_to_rownames("gene_id")
  
  results[[i]][["DESeqDataSet"]] <- results(dds.for.analysis, contrast = contrasts[[i]])
}

Principle Component Analysis

The Bioconductor package PCAtools is used to perform principle component analysis on regularized log transformed count data. Scree plots show principle component numbers on the x-axis, and their respective eigenvalues on the y-axis.

The third graph plots experimental metadata against a number of PCs and their gene loadings to visualize the agreement with the gene expression pattern of each condition for each PC.

No Regression

Replicate Regressed Out

strain_hrde1_vs_wt

DEG Report

Ontology Report

Transfering WORMBASE gene IDs to ENTREZID:

563 gene IDs were not transfered from WORMBASE to ENTREZID 
14695 gene IDs were successfully transfered from WORMBASE to ENTREZID 

Number of highly variable genes identified (with | L2FC | > 0.8 and padj < 0.1) is:

322

Number of highly upregulated genes identified (with L2FC > 0.8 and padj < 0.1) is:

247

Number of highly downregulated genes identified (with L2FC < -0.8 and padj < 0.1) is:

75

Dysregulated

Gene Ontology GSEA
protein dephosphorylation

dephosphorylation

ribonucleoprotein complex biogenesis

phosphorylation

male gamete generation

rRNA processing

rRNA metabolic process

RNA modification

mitotic DNA integrity checkpoint signaling

Reactome Pathway GSEA
Translation

Ubiquitin Mediated Degradation of Phosphorylated Cdc25A

p53-Independent DNA Damage Response

p53-Independent G1/S DNA damage checkpoint

G2/M Checkpoints

HATs acetylate histones

RUNX1 regulates genes involved in megakaryocyte differentiation and platelet function

DNA Double-Strand Break Repair

PKMTs methylate histone lysines

G1/S DNA Damage Checkpoints

mRNA Splicing

Cell Cycle

DNA Replication

mRNA Splicing - Major Pathway

RNA Polymerase I Promoter Escape

RNA Polymerase I Promoter Clearance

RNA Polymerase I Transcription

Transcriptional regulation by RUNX1

Processing of Capped Intron-Containing Pre-mRNA

Cell Cycle Checkpoints

DNA Replication Pre-Initiation

Epigenetic regulation of gene expression

Cellular Senescence

Regulation of TP53 Activity

Positive epigenetic regulation of rRNA expression

Upregulated

GO Over-representation
Reactome Pathway Over-representation

Table Reactome Pathway Over-representation Analysis of Upregulated Genes has no rows.

Downregulated

GO Over-representation

Table GO Over-representation Analysis of Downregulated Genes has no rows.

Reactome Pathway Over-representation

3. Weiser Analysis

Metadata information received from the tsv table:

##   strain replicate
## 1  hrde1      rep1
## 2  hrde1      rep2
## 3     wt      rep1
## 4     wt      rep2

Analysis Methods

Here, count data from the RNA-seq experiment is read in the form of a counts matrix. Each column holds data from one sample, and each row represents a gene, such that the i-th row and n-th column tells how reads of gene i were measured in sample n. The values received should be un-normalized counts of sequencing reads or fragments.

# # Read Count Matrix
counts_file <- trimws(sample_params$salmon_merged_gene_counts_file_path)
count.matrix <- read_tsv(counts_file)
count.matrix <- count.matrix %>%
  mutate(across(3:ncol(.), ~ as.integer(.x))) %>%
  column_to_rownames("gene_id")

gene.reference <- dplyr::select(count.matrix, 1)

count.matrix <- count.matrix %>%
  dplyr::select(-1)

Information entered as metadata describes the samples (columns) of the count matrix. This metadata is combined with the sample/column names from the count matrix so the accuracy of the metadata can be reviewed. Take time to review the table below.

# Design ColData Matrix
sample <- colnames(count.matrix)
colData <- data.frame(sample)

# Add Columns for Condition to colData
for (i in colnames(metadata)){
  colData[i] <- metadata[i]
}

colData <- column_to_rownames(colData, "sample")
##             strain  replicate
## hrde1_rep_1 "hrde1" "rep1"   
## hrde1_rep_2 "hrde1" "rep2"   
## wt_rep_1    "wt"    "rep1"   
## wt_rep_2    "wt"    "rep2"

Using the count matrix, column metadata, and user-inputed design formula (expressing the variables to be used in modeling), a DESeq data set is created.

The experimental variable and the associated control/experimental conditions are given by the user in “ref_level_cond” and “ref_level_value” in the sampleInfo file respectively.

Pre-filtering is performed, keeping only genes that have a count of at least 10 in a minimum number of samples. The minimum number of samples is decided by calculating the number of times the reference level value (control condition) is listed in the metadata with the assumption that experimental conditions will be repeated the same number of times.

Standard differential expression analysis is performed with the DESeq function, and regularized log transformation (rlog) transforms count data to a log2 scale for PCA.

# Set Reference Level
reflevelcond <- trimws(sample_params$ref_level_cond)
reflevelvalue <- trimws(sample_params$ref_level_value)
# Run DESeq2 analysis
deseq_result <- runDESeq2Analysis(
  count.matrix = count.matrix,
  colData = colData,
  design = sample_params$design,
  ref_level_cond = reflevelcond,
  ref_level_value = reflevelvalue,
  metadata = metadata
)

dds <- deseq_result$dds
rld <- deseq_result$rld

If input data is designated to be batch corrected, the ComBat_seq() function from the SVA Bioconductor package is used. The batch (condition) to be regressed out is given in sampleInfo.csv as “batch_cond”, and the sample information for this condition is taken from the metadata.

The group argument of ComBat_seq() specifies biological covariates, whos signals should be preserved in adjusted data. All conditions (columns) from the metadata which are not to be regressed out are passed to the group argument. Differential expression analysis is otherwise performed as described above on batch-corrected counts.

if (as.logical(trimws(sample_params$batch_correct))) {
  # Replicate / Batch Correction
  batch <- trimws(sample_params$batch_cond)

  batch.correct <- ComBat_seq(
    counts = as.matrix(count.matrix),
    batch = metadata[[batch]],
    group = metadata[[colnames(metadata)[colnames(metadata) != batch]]]) %>%
    as.data.frame()

  # Run DESeq2 analysis on batch-corrected counts
  batch_deseq_result <- runDESeq2Analysis(
    count.matrix = batch.correct,
    colData = colData,
    design = sample_params$design,
    ref_level_cond = reflevelcond,
    ref_level_value = reflevelvalue,
    metadata = metadata
  )

  batch.correct.dds <- batch_deseq_result$dds
  batch.correct.rld <- batch_deseq_result$rld
  
  # Cleanup
  rm(batch, batch.correct, keep)
}

Reference level condition and value (ref_level_cond and ref_level_value respectively) inputs from sampleInfo.csv are used to create contrast terms to compare treatment samples with control samples. This relies on the design formula being formatted correctly to produce comparisons relevant to the experimental condition.

# # Find relevant contrasts
contrasts <- list()

# Factor values of experimental condition
pattern <- factor(metadata[[reflevelcond]])
comp <- levels(pattern)

# Remove control state from experimental condition vector
comp <- comp[comp != reflevelvalue]

# Create Contrasts
for (i in 1:length(comp)){
  contrasts[[i]] <- c(reflevelcond, comp[i], reflevelvalue)
}

Based on the number of experimental conditions (identified in the ref_level_cond column of the metadata) and the contrast terms comparing them to the experimental control (ref_level_value), differential expression results are extracted from the DESeq data set object.

# Find results from relevant contrasts
results <- list()

# Pull results from DESeq object with contrasts
for (i in 1:length(contrasts)){
  results[[i]] <- list()
  results[[i]][["DataFrame"]] <- results(dds.for.analysis, contrast = contrasts[[i]]) %>%
    data.frame() %>%
    rownames_to_column("gene_id") %>%
    mutate(gene_name = gene.reference[gene_id, 1], .before=baseMean) %>% # Add gene names to DESeq results matrices
    column_to_rownames("gene_id")
  
  results[[i]][["DESeqDataSet"]] <- results(dds.for.analysis, contrast = contrasts[[i]])
}

Principle Component Analysis

The Bioconductor package PCAtools is used to perform principle component analysis on regularized log transformed count data. Scree plots show principle component numbers on the x-axis, and their respective eigenvalues on the y-axis.

The third graph plots experimental metadata against a number of PCs and their gene loadings to visualize the agreement with the gene expression pattern of each condition for each PC.

No Regression

Replicate Regressed Out

strain_hrde1_vs_wt

DEG Report

Ontology Report

Transfering WORMBASE gene IDs to ENTREZID:

434 gene IDs were not transfered from WORMBASE to ENTREZID 
13753 gene IDs were successfully transfered from WORMBASE to ENTREZID 

Number of highly variable genes identified (with | L2FC | > 0.8 and padj < 0.1) is:

1379

Number of highly upregulated genes identified (with L2FC > 0.8 and padj < 0.1) is:

983

Number of highly downregulated genes identified (with L2FC < -0.8 and padj < 0.1) is:

396

Dysregulated

Gene Ontology GSEA
dephosphorylation

protein phosphorylation

phosphorylation

tRNA metabolic process

protein refolding

male gamete generation

ribosome biogenesis

RNA modification

aminoglycan metabolic process

mitochondrial gene expression

response to heat

mitochondrial translation

neutral amino acid transport

Reactome Pathway GSEA
Translation

Mitochondrial translation

Mitochondrial translation termination

Mitochondrial translation elongation

Mitotic Prophase

M Phase

Negative regulation of MET activity

Depolymerization of the Nuclear Lamina

Nuclear Envelope Breakdown

RMTs methylate histone arginines

Positive epigenetic regulation of rRNA expression

B-WICH complex positively regulates rRNA expression

RNA Polymerase I Promoter Escape

RNA Polymerase I Promoter Clearance

RNA Polymerase I Transcription

Assembly of the ORC complex at the origin of replication

Major pathway of rRNA processing in the nucleolus and cytosol

rRNA processing

rRNA processing in the nucleus and cytosol

SRP-dependent cotranslational protein targeting to membrane

Signaling by VEGF

VEGFA-VEGFR2 Pathway

L13a-mediated translational silencing of Ceruloplasmin expression

Formation of a pool of free 40S subunits

Eukaryotic Translation Initiation

Upregulated

GO Over-representation
Reactome Pathway Over-representation

Downregulated

GO Over-representation

Reactome Pathway Over-representation

Session Information

This section provides complete information about the R environment, package versions, and system configuration used to generate this report. This information is critical for reproducing the analysis.

R Version and Platform:

R version 4.4.0 (2024-04-24)
Platform: aarch64-apple-darwin20
Running under: macOS 15.5

Matrix products: default
BLAS:   /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib 
LAPACK: /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/lib/libRlapack.dylib;  LAPACK version 3.12.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

time zone: America/Los_Angeles
tzcode source: internal

attached base packages:
[1] stats4    stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] here_1.0.2                  RColorBrewer_1.1-3          lubridate_1.9.4             forcats_1.0.0              
 [5] purrr_1.1.0                 tidyr_1.3.1                 tidyverse_2.0.0             patchwork_1.3.2            
 [9] rrvgo_1.18.0                EnhancedVolcano_1.24.0      org.Ce.eg.db_3.20.0         AnnotationDbi_1.68.0       
[13] vsn_3.74.0                  sva_3.54.0                  BiocParallel_1.40.2         genefilter_1.88.0          
[17] mgcv_1.9-3                  nlme_3.1-168                GOSemSim_2.32.0             DESeq2_1.46.0              
[21] SummarizedExperiment_1.36.0 Biobase_2.66.0              MatrixGenerics_1.18.1       matrixStats_1.5.0          
[25] GenomicRanges_1.58.0        GenomeInfoDb_1.42.3         IRanges_2.40.1              S4Vectors_0.44.0           
[29] BiocGenerics_0.52.0         clusterProfiler_4.14.6      ReactomePA_1.50.0           readr_2.1.5                
[33] tibble_3.3.0                stringr_1.5.2               knitrBootstrap_1.0.3        PCAtools_2.18.0            
[37] ggrepel_0.9.6               ggplot2_4.0.0               htmltools_0.5.8.1           ashr_2.2-63                
[41] kableExtra_1.4.0            knitr_1.50                  DT_0.34.0                   dplyr_1.1.4                
[45] bookdown_0.44              

loaded via a namespace (and not attached):
  [1] fs_1.6.6                  enrichplot_1.26.6         httr_1.4.7                tools_4.4.0               R6_2.6.1                 
  [6] lazyeval_0.2.2            withr_3.0.2               graphite_1.52.0           gridExtra_2.3             preprocessCore_1.68.0    
 [11] cli_3.6.5                 textshaping_1.0.3         labeling_0.4.3            slam_0.1-55               sass_0.4.10              
 [16] SQUAREM_2021.1            S7_0.2.0                  tm_0.7-16                 askpass_1.2.1             mixsqp_0.3-54            
 [21] systemfonts_1.2.3         yulab.utils_0.2.1         gson_0.1.0                DOSE_4.0.1                svglite_2.2.1            
 [26] R.utils_2.13.0            invgamma_1.2              limma_3.62.2              rstudioapi_0.17.1         RSQLite_2.4.3            
 [31] treemap_2.4-4             generics_0.1.4            gridGraphics_0.5-1        crosstalk_1.2.2           vroom_1.6.5              
 [36] GO.db_3.20.0              Matrix_1.7-4              abind_1.4-8               R.methodsS3_1.8.2         lifecycle_1.0.4          
 [41] yaml_2.3.10               edgeR_4.4.2               qvalue_2.38.0             SparseArray_1.6.2         grid_4.4.0               
 [46] blob_1.2.4                promises_1.3.3            dqrng_0.4.1               crayon_1.5.3              ggtangle_0.0.7           
 [51] lattice_0.22-7            beachmat_2.22.0           cowplot_1.2.0             annotate_1.84.0           KEGGREST_1.46.0          
 [56] pillar_1.11.0             fgsea_1.32.4              codetools_0.2-20          fastmatch_1.1-6           glue_1.8.0               
 [61] ggfun_0.2.0               data.table_1.17.8         vctrs_0.6.5               png_0.1-8                 treeio_1.30.0            
 [66] gtable_0.3.6              cachem_1.1.0              xfun_0.53                 S4Arrays_1.6.0            mime_0.13                
 [71] tidygraph_1.3.1           survival_3.8-3            pheatmap_1.0.13           statmod_1.5.1             ggtree_3.14.0            
 [76] bit64_4.6.0-1             rprojroot_2.1.1           bslib_0.9.0               affyio_1.76.0             irlba_2.3.5.1            
 [81] colorspace_2.1-2          DBI_1.2.3                 tidyselect_1.2.1          bit_4.6.0                 compiler_4.4.0           
 [86] graph_1.84.1              xml2_1.4.0                NLP_0.3-2                 DelayedArray_0.32.0       scales_1.4.0             
 [91] affy_1.84.0               rappdirs_0.3.3            digest_0.6.37             rmarkdown_2.29            XVector_0.46.0           
 [96] pkgconfig_2.0.3           umap_0.2.10.0             sparseMatrixStats_1.18.0  fastmap_1.2.0             rlang_1.1.6              
[101] htmlwidgets_1.6.4         UCSC.utils_1.2.0          shiny_1.11.1              DelayedMatrixStats_1.28.1 farver_2.1.2             
[106] jquerylib_0.1.4           jsonlite_2.0.0            R.oo_1.27.1               BiocSingular_1.22.0       magrittr_2.0.4           
[111] GenomeInfoDbData_1.2.13   ggplotify_0.1.2           wordcloud_2.6             Rcpp_1.1.0                ape_5.8-1                
[116] viridis_0.6.5             reticulate_1.43.0         stringi_1.8.7             ggraph_2.2.2              zlibbioc_1.52.0          
[121] MASS_7.3-65               plyr_1.8.9                parallel_4.4.0            Biostrings_2.74.1         graphlayouts_1.2.2       
[126] splines_4.4.0             hms_1.1.3                 locfit_1.5-9.12           igraph_2.1.4              markdown_2.0             
[131] reshape2_1.4.4            ScaledMatrix_1.14.0       XML_3.99-0.19             evaluate_1.0.5            BiocManager_1.30.26      
[136] tzdb_0.5.0                tweenr_2.0.3              httpuv_1.6.16             openssl_2.3.3             polyclip_1.10-7          
[141] gridBase_0.4-7            ggforce_0.5.0             rsvd_1.0.5                xtable_1.8-4              reactome.db_1.89.0       
[146] RSpectra_0.16-2           tidytree_0.4.6            later_1.4.4               ragg_1.5.0                viridisLite_0.4.2        
[151] truncnorm_1.0-9           aplot_0.2.9               memoise_2.0.1             timechange_0.3.0         

Package Versions:

                Package           Version 
bookdown        "bookdown"        "0.44"  
clusterProfiler "clusterProfiler" "4.14.6"
DESeq2          "DESeq2"          "1.46.0"
dplyr           "dplyr"           "1.1.4" 
EnhancedVolcano "EnhancedVolcano" "1.24.0"
ggplot2         "ggplot2"         "4.0.0" 
org.Ce.eg.db    "org.Ce.eg.db"    "3.20.0"
PCAtools        "PCAtools"        "2.18.0"
ReactomePA      "ReactomePA"      "1.50.0"

Analysis Parameters:

Report generated: 2025-11-26 16:06:33.655144
Working directory: /Users/masonmatich/Documents/research_labs/projects/bulk_RNA_seq_workflow